The Population Reference Sample, POPRES: a resource for population, disease, and pharmacological genetics research.

نویسندگان

  • Matthew R Nelson
  • Katarzyna Bryc
  • Karen S King
  • Amit Indap
  • Adam R Boyko
  • John Novembre
  • Linda P Briley
  • Yuka Maruyama
  • Dawn M Waterworth
  • Gérard Waeber
  • Peter Vollenweider
  • Jorge R Oksenberg
  • Stephen L Hauser
  • Heide A Stirnadel
  • Jaspal S Kooner
  • John C Chambers
  • Brendan Jones
  • Vincent Mooser
  • Carlos D Bustamante
  • Allen D Roses
  • Daniel K Burns
  • Margaret G Ehm
  • Eric H Lai
چکیده

Technological and scientific advances, stemming in large part from the Human Genome and HapMap projects, have made large-scale, genome-wide investigations feasible and cost effective. These advances have the potential to dramatically impact drug discovery and development by identifying genetic factors that contribute to variation in disease risk as well as drug pharmacokinetics, treatment efficacy, and adverse drug reactions. In spite of the technological advancements, successful application in biomedical research would be limited without access to suitable sample collections. To facilitate exploratory genetics research, we have assembled a DNA resource from a large number of subjects participating in multiple studies throughout the world. This growing resource was initially genotyped with a commercially available genome-wide 500,000 single-nucleotide polymorphism panel. This project includes nearly 6,000 subjects of African-American, East Asian, South Asian, Mexican, and European origin. Seven informative axes of variation identified via principal-component analysis (PCA) of these data confirm the overall integrity of the data and highlight important features of the genetic structure of diverse populations. The potential value of such extensively genotyped collections is illustrated by selection of genetically matched population controls in a genome-wide analysis of abacavir-associated hypersensitivity reaction. We find that matching based on country of origin, identity-by-state distance, and multidimensional PCA do similarly well to control the type I error rate. The genotype and demographic data from this reference sample are freely available through the NCBI database of Genotypes and Phenotypes (dbGaP).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reference Values for Serum Total Cholesterol Concentrations Using Percentile Regression Model: A Population Study in Mashhad

Background and Purpose: Serum total cholesterol (TC) concentrations are affected by several factors including ethnicity, diet, geographic, and environmental determinants, and are related to another disease, including hypothyroidism, and renal and liver disease. It is associated with an increased risk of cardiovascular disease, particularly if associated with high levels of serum low-density lip...

متن کامل

Posterior predictive checks to quantify lack-of-fit in admixture models of latent population structure.

Admixture models are a ubiquitous approach to capture latent population structure in genetic samples. Despite the widespread application of admixture models, little thought has been devoted to the quality of the model fit or the accuracy of the estimates of parameters of interest for a particular study. Here we develop methods for validating admixture models based on posterior predictive checks...

متن کامل

An Investigation on Population Structure and Inbreeding of Sangsari Sheep

The aim of this study was to describe inbreeding and population structure in Sangsari sheep breeding station. For this reason, data from 7028 Sangsari sheep which were collected during 1987-2014 in Sangsari sheep breeding station located near to Damghan city, Semnan province were used. Lambs born during 2010-2014 were considered as reference population. The genetic structure analysis of the pop...

متن کامل

The Drosophila Genome Nexus: A Population Genomic Resource of 623 Drosophila melanogaster Genomes, Including 197 from a Single Ancestral Range Population

Hundreds of wild-derived Drosophila melanogaster genomes have been published, but rigorous comparisons across data sets are precluded by differences in alignment methodology. The most common approach to reference-based genome assembly is a single round of alignment followed by quality filtering and variant detection. We evaluated variations and extensions of this approach and settled on an asse...

متن کامل

Analysis of rs6725887 in the WD Repeat Protein 12 in Association With Coronary Artery Disease in Iranian Patients

Although genetic variants that affect susceptibility to coronary artery disease (CAD) have been greatly known, a number of these single nucleotide polymorphisms (SNPs) remain to be analyzed in populations with different ethnicities. CAD is influenced by numerous genetic, environmental, and lifestyle factors, and is an important reason for mortality around the globe. In this study, a novel SNP (...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • American journal of human genetics

دوره 83 3  شماره 

صفحات  -

تاریخ انتشار 2008